cgctctagcc cggtgggaag ctttcatcca gaaca atg aat ttc ata aag gac 53 

Met Asn Phe lie Lys Asp 
1 5 

aat age cga gec ctt att caa aga atg gga atg act gtt ata aag caa 101 
Asn Ser Arg Ala Leu He Gin Arg Met Gly Met Thr Val He Lys Gin 
10 15 20 

ate aca gat gac eta ttt gta tgg aat gtt ctg aat cgc gaa gaa gta 14 9 

He Thr Asp Asp Leu Phe Val Trp Asn Val Leu Asn Arg Glu Glu Val 
25 30 35 

aac ate att tgc tgc gag aag gtg gag cag gat get get aga ggg ate 197 
Asn He He Cys Cys Glu Lys Val Glu Gin Asp Ala Ala Arg Gly He 
40 45 50 

att cac atg att ttg aaa aag ggt tea gag tec tgt aac etc ttt ctt 245 
He His Met He Leu Lys Lys Gly Ser Glu Ser Cys Asn Leu Phe Leu 
55 60 65 70 

aaa tec ctt aag gag tgg aac tat cct eta ttt cag gac ttg aat gga 293 
Lys Ser Leu Lys Glu Trp Asn Tyr Pro Leu Phe Gin Asp Leu Asn Gly 
75 80 85 

caa agt ctt ttt cat cag aca tea gaa gga gac ttg gac gat ttg get 341 
Gin Ser Leu Phe. His Gin Thr Ser Glu Gly Asp Leu Asp Asp Leu Ala 
90 95 100 

cag gat tta aag gac ttg tac cat ace cca tct ttt ctg aac ttt tat 389 
Gin Asp Leu Lys Asp Leu Tyr His Thr Pro Ser Phe Leu Asn Phe Tyr 
105 110 115 

ccc ctt ggt gaa gat att gac att att ttt aac ttg aaa age ace ttc 437 
Pro Leu Gly Glu Asp He Asp He He Phe Asn Leu Lys Ser Thr Phe 
120 125 130 

aca gaa cct gtc ctg tgg agg aag gac caa cac cat cac cgc gtg gag 485 
Thr Glu Pro Val Leu Trp Arg Lys Asp Gin His His His Arg Val Glu 
135 140 145 150 

cag ctg acc ctg aat ggc etc ctg cag get ctt cag age ccc tgc ate 533 
Gin Leu Thr Leu Asn Gly Leu Leu Gin Ala Leu Gin Ser Pro Cys He 
155 160 165 

att gaa ggg gaa tct ggc aaa ggc aag tec act ctg ctg cag cgc att 581 
He Glu Gly Glu Ser Gly Lys Gly Lys Ser Thr Leu Leu Gin Arg He 
170 175 180 

gee atg etc tgg ggc tec gga aag tgc aag get ctg acc aag ttc aaa 629 
Ala Met Leu Trp Gly Ser Gly Lys Cys Lys Ala Leu Thr Lys Phe Lys 
185 190 195 

ttc gtc ttc ttc etc cgt etc age agg gec cag ggt gga ctt ttt gaa 677 
Phe Val Phe Phe Leu Arg Leu Ser Arg Ala Gin Gly Gly Leu Phe Glu 
200 205 210 



F/9.IA 



acc etc tgt gat caa etc ctg gat ata cct ggc aca ate agg aag cag 725 
Thr Leu Cys Asp Gin Leu Leu Asp lie Pro Gly Thr He Arg Lys Gin 
215 220 225 230 

aca ttc atg gee atg ctg ctg aag ctg egg cag agg gtt ctt ttc ctt 773 
Thr Phe Met Ala Met Leu Leu Lys Leu Arg Gin Arg Val Leu Phe Leu 
235 240 245 

ctt gat ggc tac aat gaa ttc aag ccc cag aac tgc cca gaa ate gaa 821 
Leu Asp Gly Tyr Asn Glu Phe Lys Pro Gin Asn Cys Pro Glu He Glu 
250 255 260 

gec ctg ata aag gaa aac cac cgc ttc aag aac atg gtc ate gtc acc 869 
Ala Leu He Lys Glu Asn His Arg Phe Lys Asn Met Val He Val Thr 
265 270 275 

act acc act gag tgc ctg agg cac ata egg cag ttt ggt gec ctg act 917 
Thr Thr Thr Glu Cys Leu Arg His He Arg Gin Phe Gly Ala Leu Thr 
280 285 290 

get gag gtg ggg gat atg aca gaa gac age gec cag get etc ate cga 965 
Ala Glu Val Gly Asp Met Thr Glu Asp Ser Ala Gin Ala Leu lie Arg 
295 300 305 310 

gaa gtg ctg ate aag gag ctt get gaa ggc ttg ttg etc caa att cag 1013 
Glu Val Leu He. Lys Glu Leu Ala Glu Gly Leu Leu Leu Gin He Gin 
315 320 325 

aaa tec agg tgc ttg agg aat etc atg aag acc cct etc ttt gtg gtc 1061 
Lys Ser Arg Cys Leu Arg Asn Leu Met Lys Thr Pro Leu PJie Val Val 
330 335 340 

ate act tgt gca ate cag atg ggt gaa agt gag ttc cac tct cac aca 1109 
He Thr Cys Ala He Gin Met Gly Glu Ser Glu Phe His Ser His Thr 
345 350 355 

caa aca acg ctg ttc cat acc ttc tat gat ctg ttg ata cag aaa aac 1157 
Gin Thr Thr Leu Phe His Thr Phe Tyr Asp Leu Leu He Gin Lys Asn 
360 365 370 

aaa cac aaa cat aaa ggt gtg get gca agt gac ttc att egg age ctg 1205 
Lys His Lys His Lys Gly Val Ala Ala Ser Asp Phe He Arg Ser Leu 
375 380 385 390 

gac cac tgt gga gac eta get ctg gag ggt gtg ttc tec cac aag ttt 1253 
Asp His Cys Gly Asp Leu Ala Leu Glu Gly Val Phe Ser His Lys Phe 
395 400 405 

gat ttc gaa ctg cag gat gtg tec age gtg aat gag gat gtc ctg ctg 1301 
Asp Phe Glu Leu Gin Asp Val Ser Ser Val Asn Glu Asp Val Leu Leu 
410 415 420 

aca act ggg etc etc tgt aaa tat aca get caa agg ttc aag cca aag 134 9 

Thr Thr Gly Leu Leu Cys Lys Tyr Thr Ala Gin Arg Phe Lys Pro Lys 
425 430 435 



Fig. IB 



t # 



tat aaa ttc ttt cac aag tea ttc cag gag tac aca gca gga cga aga 1397 
Tyr Lys Phe Phe His Lys Ser Phe Gin Glu Tyr Thr Ala Gly Arg Arg 
440 445 450 

etc age agt tta ttg acg tct cat gag cca gag gag gtg acc aag ggg 1445 
Leu Ser Ser Leu Leu Thr Ser His Glu Pro Glu Glu Val Thr Lys Gly 
455 460 465 470 

aat ggt tac ttg cag aaa atg gtt tec att teg gac att aca tec act 1493 
Asn Gly Tyr Leu Gin Lys Met Val Ser He Ser Asp He Thr Ser Thr 
475 480 485 

tat age age ctg etc egg tac acc tgt ggg tea tct gtg gaa gee acc 1541 
Tyr Ser Ser Leu Leu Arg Tyr Thr Cys Gly Ser Ser Val Glu Ala Thr 
490 495 500 

agg get gtt atg aag cac etc gca gca gtg tat caa cac ggc tgc ctt 1589 
Arg Ala Val Met Lys His Leu Ala Ala Val Tyr Gin His Gly Cys Leu 
505 510 515 

etc gga ctt tec ate gee aag agg cct etc tgg aga cag gaa tct ttg 1637 
Leu Gly Leu Ser He Ala Lys Arg Pro Leu Trp Arg Gin Glu Ser Leu 
520 525 530 

caa agt gtg aaa aac acc act gag caa gaa att ctg aaa gee ata aac 1685 
Gin Ser Val Lys. Asn Thr Thr Glu Gin Glu He Leu Lys Ala He Asn 
535 540 545 550 

ate aat tec ttt gta gag tgt ggc ate cat tta tat caa gag agt aca 1733 
He Asn Ser Phe Val Glu Cys Gly He His Leu Tyr Gin Glu Ser Thr 
555 560 565 

tec aaa tea gee ctg age caa gaa ttt gaa get ttc ttt caa ggt aaa 1781 
Ser Lys Ser Ala Leu Ser Gin Glu Phe Glu Ala Phe Phe Gin Gly Lys 
570 575 580 

age tta tat ate aac tea ggg aac ate ccc gat tac tta ttt gac ttc 1829 
Ser Leu Tyr He Asn Ser Gly Asn He Pro Asp Tyr Leu Phe Asp Phe 
585 590 595 

ttt gaa cat ttg ccc aat tgt gca agt get ctg gac ttc att aaa ctg 1877 
Phe Glu His Leu Pro Asn Cys Ala Ser Ala Leu Asp Phe He Lys Leu 
600 605 610 

gac ttt tat ggg gga get atg get tea tgg gaa aag get gca gaa gac 1925 
Asp Phe Tyr Gly Gly Ala Met Ala Ser Trp Glu Lys Ala Ala Glu Asp 
615 620 625 630 

aca ggt gga ate cac atg gaa gag gee cca gaa acc tac att ccc age 1973 
Thr Gly Gly He His Met Glu Glu Ala Pro Glu Thr Tyr He Pro Ser 
635 640 645 

agg get gta tct ttg ttc ttc aac tgg aag cag gaa ttc agg act ctg 2021 
Arg Ala Val Ser Leu Phe Phe Asn Trp Lys Gin Glu Phe Arg Thr Leu 
650 655 660 



Re, \c 



gag gtc aca etc egg gat ttc age aag ttg aat aag caa gat ate aca 2069 
Glu Val Thr Leu Arg Asp Phe Ser Lys Leu Asn Lys Gin Asp lie Thr 
665 670 675 

tat ctg ggg aaa ata ttc age tct gee aca age etc agg ctg caa ata 2117 
Tyr Leu Gly Lys lie Phe Ser Ser Ala Thr Ser Leu Arg Leu Gin lie 
680 685 690 

aag aga tgt get ggt gtg get gga age etc agt ttg gtc etc age ace 2165 
Lys Arg Cys Ala Gly Val Ala Gly Ser Leu Ser Leu Val Leu Ser Thr 
695 700 705 710 

tgt aag aac att tat tct etc atg gtg gaa gec agt ccc etc acc ata 2213 
Cys Lys Asn lie Tyr Ser Leu Met Val Glu Ala Ser Pro Leu Thr lie 
715 720 725 

gaa gat gag agg cac ate aca tct gta aca aac ctg aaa acc ttg agt 2261 
Glu Asp Glu Arg His lie Thr Ser Val Thr Asn Leu Lys Thr Leu Ser 
730 735 740 

att cat gac eta cag aat caa egg ctg ccg ggt ggt ctg act gac age 2309 
lie His Asp Leu Gin Asn Gin Arg Leu Pro Gly Gly Leu Thr Asp Ser 
745 750 755 

ttg ggt aac ttg aag aac ctt aca aag etc ata atg gat aac ata aag 2357 
Leu Gly Asn Leu. Lys Asn Leu Thr Lys Leu lie Met Asp Asn lie Lys 
760 765 770 

atg aat gaa gaa gat get ata aaa eta get gaa ggc ctg aaa aac ctg 2405 
Met Asn Glu Glu Asp Ala lie Lys Leu Ala Glu Gly Leu Lys Asn Leu 
775 780 785 790 

aag aag atg tgt tta ttt cat ttg acc cac ttg tct gac att gga gag 2453 
Lys Lys Met Cys Leu Phe His Leu Thr His Leu Ser Asp lie Gly Glu 
795 800 805 

gga atg gat tac ata gtc aag tct ctg tea agt gaa ccc tgt gac ctt 2501 
Gly Met Asp Tyr lie Val Lys Ser Leu Ser Ser Glu Pro Cys Asp Leu 
810 815 820 

gaa gaa att caa tta gtc tec tgc tgc ttg tct gca aat gca gtg aaa 2549 
Glu Glu lie Gin Leu Val Ser Cys Cys Leu Ser Ala Asn Ala Val Lys 
825 830 835 

ate eta get cag aat ctt cac aat ttg gtc aaa ctg age att ctt gat 2597 
lie Leu Ala Gin Asn Leu His Asn Leu Val Lys Leu Ser lie Leu Asp 
840 845 850 

tta tea gaa aat tac ctg gaa aaa gat gga aat gaa get ctt cat gaa 2645 
Leu Ser Glu Asn Tyr Leu Glu Lys Asp Gly Asn Glu Ala Leu His Glu 
855 860 865 870 

ctg ate gac agg atg aac gtg eta gaa cag etc acc gca ctg atg ctg 2693 
Leu lie Asp Arg Met Asn Val Leu Glu Gin Leu Thr Ala Leu Met Leu 
875 880 885 



Fi 3 .ID 



ccc tgg ggc tgt gac gtg caa ggc age ctg age age ctg ttg aaa cat 
Pro Trp Gly Cys Asp Val Gin Gly Ser Leu Ser Ser Leu Leu Lys His 
890 895 900 



2741 



ttg gag gag gtc cca caa etc gtc aag ctt ggg ttg aaa aac tgg aga 2789 
Leu Glu Glu Val Pro Gin Leu Val Lys Leu Gly Leu Lys Asn Trp Arg 
905 910 915 

etc aca gat aca gag att aga att tta ggt gca ttt ttt gga aag aac 2837 
Leu Thr Asp Thr Glu lie Arg lie Leu Gly Ala Phe Phe Gly Lys Asn 
920 925 930 

cct ctg aaa aac ttc cag cag ttg aat ttg gcg gga aat cgt gtg age 2885 
Pro Leu Lys Asn Phe Gin Gin Leu Asn Leu Ala Gly Asn Arg Val Ser 
935 940 945 950 

agt gat gga tgg ctt gee ttc atg ggt gta ttt gag aat ctt aag caa 2533 
Ser Asp Gly Trp Leu Ala Phe Met Gly Val Phe Glu Asn Leu Lys Gin 
955 960 965 

tta gtg ttt ttt gac ttt agt act aaa gaa ttt eta cct gat cca gca 2981 
Leu Val Phe Phe Asp Phe Ser Thr Lys Glu Phe Leu Pro Asp Pro Ala 
970 975 980 

tta gtc aga aaa ctt age caa gtg tta tec aag tta act ttt ctg caa 3029 
Leu Val Arg Lys. Leu Ser Gin Val Leu Ser Lys Leu Thr Phe Leu Gin 
985 990 995 

gaa get agg ctt gtt ggg tgg caa ttt gat gat gat gat etc agt gtt 3077 
Glu Ala Arg Leu Val Gly Trp Gin Phe Asp Asp Asp Asp Leu Ser Val 
1000 1005 1010 

att aca ggt get ttt aaa eta gta act get taa ataaagtgta ctcgaagcca 3130 
lie Thr Gly Ala Phe Lys Leu Val Thr Ala * 
1015 1020 

gta , 3133 



Fij.it 



1 ATGCTGAACGCTGGTCCCCTGGGCTCCCTTATTTCTTTCTCTATACTTTGTCTC 

TACGACTTGCGACCAGGGGACCCGAGGGAATAAAGAAAGAGATATGAAACAGAGACACAGAAAAAGAAAA 
. 1 J > _^ 1 . _ L . _N.A G P L G S L I S F S I L C L C V F F F 



7 1 CCAAGTCTCTCGTTCCACCTAACGAGAAACACCCACAGA^ 
GGTTC^GAGAGCAAGGTGGATTGCTCTTTGTGGGTGTCITC 
_ _24j>_S_ _K _ S_ _L_ V _ P_ _P_ _N _ E_ _K_ _H _ P__Q__N_ K_ _K _ V _ S_ _G _ L _ Q_ _E L E 

141 GGCCTCACTGAAACGGAAAGCAAATACAAAGAAA<^^ 
CCGGAGTGACTTTGCCTTTCGTTTATGTTTCTTTC 
_ -IT?. A . S_ A _ K _ _R_ _K_ A _ _N_ _T _ K_ _K_ _L _ Y_ _F_ _K _ N_ _M_ _S _ W_ _S_ _P K K 

2 11 AGGGCAATTGGATTGCTCAGCCAGAGACCCT^ 

TCCCGTTAACCTAACGAGTCGGTCTCTGGGAACGTCCGTCTGTGTGTTCG^ 
. .I 1 ?- R _ A _ L _ 9. _ L _ A _ S_ _Q _ R _ P_ _L_ _Q_ A_ _D _T__Q_ _A_ A _ _G_ _R _ R_ E E 

281 ACATCGGCGGAAGAACATACAAGCAGCTGGACGTCCAGAGGACGTTGAAGGGAGAATCC^^ 
TGTAGCCGCCTTCTTCTATGTTCGTCGACCTGCAGGTCTCCTGCAACTTC 
^^A.L.G.G. _ R TYKQL DVQRTL KGECWRKS 



^351 CACACAACAGAC^TCGGCACGCCAGCAGGCCATCC^^ 
M, i GTGTGTTGTCTGTAGCCGTGCGGTCGTCCGGTAGGTGGTCTCCTTGCTGAGC 

.¥17?. .1.9 QTSARQQA I HQRNDSEFG lTe V 



: : 



$21 AATTTCATAAAGGACAATAGCCGAGCCCTTATTCAAAGAATGGGAATGACTGTTATAAAG^ 

TTAAAGTATTTCCTGTTATCGGCTCGGGAATAAGTTTCTTACCCTTACTGACAAT^ 
.1.41 N__F__I__K D N S R A L I QRMGMTVI KQI T 



391 ATGACCTATTTGTATGGAATGTTCTGAATCGCGAAGAAGTAAACATCATTTGCTC 
g \ TACTGGATAAACATACCTTACAAGACTTAGCGCTTCTTCATTTGTAGTA 
. li 4 .^- 0 - _D _ L _ _F_ _V _ W_ _N_ _V _ L _ _N_ _R _ E_ _E_ _V _ N_ _l_ I C C E K V E Q 

561 GGATGCTGCTAGAGGGATCATTCACATGATTTTGAAAAAGGGTTC^ 
CCTACGACGATCTCCCTAGTAAGTGTACTAAAACTT^ 
. i?.7j>_ _D_ A_ _A_ _R_ G_ _l_ _ I _ H_ _M _ l_ _ _L_ _K _ K_ _G_ _S _ E_ _S_ _C _ N_ _L_ _F L K 

631 TCCCTTAAGGAGTGGAACTATCCTCTATTTCAGGACTTC 

AGGGAATTCCTCACCTTGATAGGAGATAAAGTCCTGAACTTACCTGTTTC 
. 211?_ §_ _L_ _K__E_ W _N_ Y_ _P__L _ F_ _Q _ D _ L_ _N _G_ _Q_ _S j F_ E_ _E _T__Q_ _N _ _ 

701 GGGTCTTCTTTAACATCACCTCTTCTCTAATAGGTCTTTTTC 

CCCAGAAGAAATTGTAGTGGAGAAGAGATTATCCAGAAAAAGTAGTCTG^ 
. _V _ F_ _F_ _N _|_ T_ _S _ S_ _L_ _ l_ _ G_ _L_ _F _ _H_ _Q _T _ _S_ _E__G__D_ _L _ D _ _D_ 

771 TTTGGCTCAGGATTTAAAGGACTTGTACCATAC 

AAACCGAGTCCTAAATTTCCTGAACATGGTATGGGGTAGA 
257>_ _L. A_.Q _D_ L_ _K DLYHTPSFLNFYPLGED 



841 ATTGACATTATTTTTAACTTC 

TAACTGTAATAAAAATTGAACTTTTCGTGGAAGTC 
281?. j. Dl I FNLKSTFTEPVLWRKDQHH 



911 ACCGCGTGGAGCAGCTGACCC 

TGGCGCACCTCGTCGACTGGGACTTACCGGAGGACGTCCGAGAAGTC 
. 304J>H_ _R _ V_ _E_ _Q _ L. J_ _L _ N_ _G_ _L _ L_ _Q _A _ L_ _Q _S _ P_ JC _ l_ _ J _ eJTg_ E_ 

■ 981 ATCTGGCAAAGGCAAGTCCACTCTGCTGCAGCGAATTC 
TAGACCGTTTCCGTTCAGGTGAGAC^ 
. 327»_ _S _ G_ _K _G_ K. _T_ L_ l\ Q_ R_ J_ _A_ M_ _L _ W_ J3 _S _ G_ _K _C _ K_ A_ _ 

1051 CTGACC^GTTCAAATTCGTCTTCTTC 

GACTGGTTCAAGTTTAAGCAGAAGAAGGAGGCAGAGTCGTC 
_ 351>. t. J. _K _ f _ A _F _ V_ _F_ _F _ L_ _R _ L _ S_ _R _A _ Q_ _G _ G_ L_ _F _ E_ T_ _L _ _ 

1121 GTGATCAACTCCTGGATATACCTGGCACAJ\TCAGGAAGCAGAC 
CACTAGTTGAGGACCTATATGGACCGTGTTAGTCCTTCGTCTC 
374t>C D Q L L D I P G T_ _ l_ _ R_ _K_ _Q J F ^M£A M L L K L R 



J.1L91 GCAGAGGGTTCTTTTCCTTCTTGATG^ 

0 1 CGTCTCCCAAGAAAAGGAAGAACTACCGATGTTACTTAAGTTCGGGGTC 

. i;397j»_ _ Q _ _R V_ _ L _ F_ A - 1 . P. A - Y- it _E\ F _ K_ _P _ Q_ N_ _C _ P_ _E_ _ l_ _ E_ A_ _ 

|2 61 CTGATAAAGGAAAACCACCGCTTCAAGAACATGGTCATCGTCACC^ 
yl GACTATTTCCTTTTGGTGGCGAAGTTCTTGTA 

. tf |21j>_ L_ J_ _K _ E_ _N _H _ _R_ _F_ _K _ _N_ _M _ V _ J _ _V _ T _ T_ _T _ T _ E C L R H 

5331 TACGGCAGTTTGGTGCCCTGACTGCTGAGGTGG 

^; ATGCCGTCAAACCACGGGACTGACGACTCCACCCCCTATACTGTCTTCTGTC 

jj|44M_ _R _ Q_ F_ _G _ A_ L_ _T _ A_ E_ _V _ G_ _D _M _ J_ JE_ _D _ S_ A_ _Q_ A_ J-_ _ L _ R_ 

§401 AGAA.GTGCTGATCAAGGAGCTTGCTGAAGGCTTC 

TCTTCACGACTAGTTCCTCGAACGACTTCCGAACAACGAGGTT^ 

- 467j>_ _ § _ V_ _L. - L _ K. .E _ L _ A_ _E _ G_ L_ _L _ L_ Q_ J_ _ Q_ _S _ R_ J3 _ R^ _N ^ 

1471 CTCATGAAGACCCCTCTCTTTGTGGTCATC^ 

GAGTACTTCTGGGGAGAGAAACACCAGTAGTGAACACGTTAGGTCTACCCACT^ 

- 491?_ L_ W[ _ J_ _P_ _L _ F_ _V _V ^ J _ _T _C_ A_ .L .Q. R.GJ b S_E__F_ ^ _ 

1541 ACACAQW^CAACGCTGTTCCATACCTTCTATGATCTGTTC 

TGTGTGTTTGTTGCGACAAGGTATGGAAGATACTA 
. 514_^ H_ _T ^ Q_ T_ _T _ L_ _F_ _H _ T_ F_ _Y _ D_ _L _ J _ _Q _K ^ N_ _K _H _ K_ JH _K_ G_ 

1611 'TGTGGCTGCAAGTGACTTCATTCGGAGCCTC 

ACACCGACGTTCACTGAAGTAAGCCTCGGACCTGGTGACACCTC 

- 5 3 . 7 J > . - Y . A. A . § . P. « F „ i . A ^ L _ D_ _ C_ J3_ .D _ L. A. - F. JG _V _ F^ ^ _ 

1681 CACAAGTTTGATTTCGAACTGCA.GGA 

GTGTTC?iAACTAAAGCTTGACGTCCTACACAGGTC , 
.561^ HKFDFELQDVSSVNEDVLLTTGL 



1751 TCTGTAAATATACAGCTCAAA 

AGACATTTATATGTCGAGTTTCCAAGTTCGGTTTC 
-584j>L_ CKYTAQRFKPKYKFFHKSFQ E YT 



1821 AGCAGGACGAAGACTCAGCAGTTTATTGACGTCTCATGAGCCAGAGGAGGTGACCAA 

TCGTCCTGCTTCTGAGTCGTCAAATAACTGCAGAGTACTCGGTC 
-§°J?-A- P-JI J- .SSL LTSHEPEEVTKGNGY 



1891 TTGCAGAAAATGGTTTCCATTTCGGACATTACA 
AACGTCTTTTACCAAAGGTAAAGCCTGTA^ 
631^ LQKMVSI SDI TSTYSSL L RYTCG 



1961 CATCTGTGGAAGCCACCAGGGCTGTTATGAAGCACCTCG 

GTAGACACCTTCGGTGGTCCCGACAATACTTCGTGGAGCGTCG 
654&>S SVEATRA VMKHLAAVYQHGCLLG 



2031 ACTTTCCATCGCCAAGAGGCCTCTCTGGAGACAGGAATC 

TGAAAGGTAGCGGTTCTCCGGAGAGACCTCTGTCCTTAGAAACGTT^ 
§17?- _t _ _S_ J_ _A _ K_ _R _ P_ L_ \N _ R_ Q_ _E _ S_ L_ _Q _ S_ V_ _K _ N_ _T_ _T _ E_ JQ 

42101 GAAATTCTGAAAGCCATAAACATCAATTCCTTTC 

01 CTTTAAGACTTTCGGTATTTGTAGTTAAGGAAACATC 

^jlOl^ El LK A I Nl NSFVECGI HLYQEST 



^171 CCAAATCAGCCCTGAGCCAA^ * 
Ul GGTTTAGTCGGGACTCGGTTC^ 

im^S KSALSQEFEA FFQGKSL Y I NSGN 



^2 4 1 CATCCCCGATTACTTATTTGACTTCTT^ 

~; GTAGGGGCTAATGAATAAACTGAAGAAACTTGTAAACGGGT^ 

^Z47^-.L-f__D _Y_ L__F _D FFEHLPNCASALDFIK 

_ _ — _ ~ — "*""* _ — ~ — — — — — — — — — — — — — —«» — — — — — — — — — - 

SP 1 1 CTGGACTTTTATGGGGGAGCTATGGCTTCAIK^^ 

GACCTGAAAATACCCCCTCGATACCGAAGTACCCTTTTCCGACGTCTTCTGTGTCCACCTTAGGTG 
771> LDFYGGAMASWEKAAEDTGGI H M 



2381 AAGAGGCCCCAGAAACCTACATTCCCAGCAGJGGCTGTATCTTTGTTC^ 

TTCTCCGGGGTCTTTGGATGTAAGGGTCGTCCCGACATAGAAACA^^ 
_794^E EAPETYI PSRAVSLFFNWKQEFR 



2451 GACTCTGGAGGTCACACTCCGGGATTTC^GCAAGTTGAATAAGCAAGATATC^ 

CTGAGACCTCCAGTGTGAGGCCCTAAAGTCGTTCAACTTATTCGTTCTATAGTCTATAG^ 
817> TLEVTLRDFSKLNKQDI RYLGKI 



2521 TTCAGCTCTGCCACAAGCCTCAGGCTGCAAATAAAGAGATGTGCT^ 

AAGTCGAGACGGTGTTCGGAGTCCGACGTTTATTrcTCTACACGACCACACCGACCTTC 
_841» FSSATSLRLQI KRCAGVAGSLSL 



2591 TCCTCAGCACCTGTAAGAACATTTATTCTCTCATGGTGGAAGCCAGTCCCCTC 

AGGAGTCGTGGACATTCTTGTAAATAAGAGAGTACCACC^ 
_ 864>y L STCK N l_Y_S L _ M _V_ E _ A _S P L T I E DE R 



pig. 




2661 GC^CATCAC^TCTGTAAC\AACCTC 

CGTGTAGTGTAGACATTGTTTGGACT^ 
887l> HI TSVTNLKTLSI HDLQNQRLPG 



2731 GGTCTGACTGACAGCTTGGGTAACTTGAAGAACCTTACAAAGGTGA 
CCAGACTGACTGTCGAACCCATTGAACTTCTTGGAATGT^ 
911» GLTDSLGNL KNLTKL I MDN I KMN 



2801 AAGAAGATGCTATAAAACTAGCTGAAGGCCTGAAAAACCTGAAGAAG 

TTCTTCTACGATATTTTGATCGACTTCCGGACTT^ 
-21 4 ?A.^.P_ A. 1 KLAEGLKNLKKMCL FHLTH 



2871 CTTGTCTGACATTGGAGAGGGAATGGATTA 

GAACAGACTGTAACCTCTCCCTTACCTAATGTAT^ 
_ 957t>_L SDI GEGMDYI VKSLSSEPCDL. E 



2941 GAAATTCAATTAGTCTCCTGCTGCTTGTCTGCA 
CTTTAAGTTAATCAGAGGACGACGAACAGACG^ 
_28lj > _E_J _QLVSCCLSANAVKI LAQNLHN 



y£011 TGGTCAAACTGAGCATTCTTGATTTATCA 

0 1 ACCAGTTTGACTCGTAAGAACTAAATAGTCTTTTAATQ 

_ !<- A-L- _k .D _ L_ JS__E_ N_ _Y_ _L _ E_ _K _D _ G_ _N _E _ A_ JL_ _H_ E 

bj081 ACTGATCGACAGGATGAACGTGCTAGAACAGCTCACCGCACTGATGCT^ 
^ f TGACTAGCTGTCCTACTTXXIACGATCTTGTCGA 

tl027> LI DRMNVL EQLTA LML PWGCDVQ 



§3-51 GGCAGCCTGAGCAGCCTGTTGAAACATTTGGAGGAGGTC 

p5 CCGTCGGACTCGTCGGACAACTTTGTAAACCTCCTCCAGGGTGTTC 

i^51t> GSLSSLLKHLEEVPQLVKLGLKN 



Q'l21 GGAGACTCACAGATACAGAGATTAGAATTTTAGGTGCAT^ 

CCTCTGAGTGTCTATGTCTCTAATCTTAAAATCCACGTAAAA^ 
1074l>W RLTDTEI Rl LGAFFGKNPLKNFQ 



3291 GCAGTTGAATTTGGCGGGAAATCGTGTGAGCAGTC 

CGTCAACTTAAACCGCCCTTTAGCACACTCGTCACTACCTACCGAACGGA^ 
1097t> QL NLA GNRVSSDGWLA FMGVF EN 



3361 CTTAAGCAATTAGTGTTTTTTGACTTTA 

GAATTCGTTAATCACAAAAJ\ACTGAAATCAT^ 
1121i> L KQLVFFDFSTKEFL PDP A LVRK 



3431 TTAGCCAAGTGTTATCCAAGTTAACTTTTCTGC 
AATCGGTTCAGAATAGGTTCAATTGAAAAGAC^ 
1144>L SQVLSKLTFLQEARLVGWQFDDD 



3501 TGATCTCAGTGTTATTAGAGATGAGAAAGCTCAGATGATT^ 

ACTAGAGTCACAATAATGTCTACTCTTTC 
1167l> D L S _V_ J_ T _D E_ K A Q M _ I _ C _P W V_ l_ K L L PY 



Rg. 



3571 ACAGTGGCAGCATC^GAACTGGAATTCAGATCTCTTGCCTCCTAG 
TGTCACCGTCGTAGTCTTGACCTTAAGTCTAGAGAACGGAGGATC 
1191^ TVAASELEFRSLAS 
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4) 

m 

□ 

m 

4) 

O 
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Fig. AE: 
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■ Beta, Regions • Chou-Fasman 

■ Turn. Regions - Garnier-Robson 

■ Turn. Regions - Chou-Fasman 
□ Coil, Regions - Garnier-Robson 



I Hydrophilicity Plot - Kyte-Ooolittle 
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Q Flexible Regions - Karplus-Schulz 



H Antigenic Index - Jameson-Wolf 



□ Surface Probability Plot - Emini 
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CARD: domain 1 of 1, from 2 to 88: score 16.0, E = 0.0065 

D HO J 7 *->aeddrrllrknrlellgeltlsglLdhLleknvLteeeeEkikaJcnt 
+ + + + +g ++++-»• d+L nvL++ee+ i + 

CARD12 2 --NFIKDNSRALIQRMGMWIKQITDDLFVWNVLNREEVNIICCEKV 4 6 

trrdkareLiDsvqkkGnqAfqiFlqaLretdqelladlllde<-* f 
++ d ar i +++kkG++ +++F1 +L+e ++ 1 +dl + j 
CARD 12 47 EQ-DAARGIIHMILKKGSESCNLFLKSLKEWNYPLFQDLNGQS 88 



LRR: domain 1 of 4, 



CARD 12 



764 



LRR: domain 2 of 4, 



CARD 12 



821 



from 764 to 791: score 0.6, E ~ 8.2e+02 

* ->nLeeLdLsnN . Lt slppglf snLp<- * 

nL++L ++n + ♦++ +1 +g ♦♦nL+ 
NLTKLIMDNIkMNeedaiKLAEG-LKNLK 791 

from 821 to 848: score 0.3, E = 9.3e+02 
*->nLeeLdLsnN.Lt . . . . slppglf snLp<-* 
Lee* L ♦+ L+ + ++ ++nL 
DLEEIQLVSCcLSanavKILAQNLHNLV 848 



23 



LRR: domain 3 of 4, from 849 to 872: score 11.2, E 
i£0LXD MO; £ *->nLeeLdLsnN.LtslppglfsnLp<-'* 

+ L LdLs N L+ + +++++ L 
CARD12 849 KLSILDLSENyLEKDGNEALHELI 872 

LRR: domain 4 of 4, from 938 to 965: score 4.2, E = 2.5e+02 

SF&I-fl MS *->nLeeLdLsnN.Lt slppglf snLp<- * 

n + L+L +N+ ++++ + +f+nL+ 
CARD12 938 NFQQLNIAGNrVSsdgwlAFMG-VFENLK 965 



Fig, 5* C 
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1 
1 



L . 



L . S 



.EGE.G.GK 



L L 



I A . L W . S G Consensus #1 



EQLVLNGVLGALNSVC I VEGEAGSGKSVLLQKIAFLWGSG Majority 



10 



E Q 

La p 



LTLNGLLQA 
llJ vllJ PEVFGN 



P C I I 

LU nLsJ v m c v 



20 
J. 



E G EiS 



ejueIaLsis 



30 
i 



r 
40 



G K 



G K 



S T 
T V 



L L 



Q R 

K K 



I A 



L W 



S G 



C&FD12-C 



41 
41 



V F 



L S 



R . 



G L 



CDQLL 



G . 



Oocisaisus #1 



KCKALTKFOLVFFLSLSSTRADGGLASILCDQLLDIEGSV Majority 



K 



K A 



cLy p l 



L T K 



l_y n r[fj q l 



50 



K F 



V F 



y[l 



GO 
L, 



L S|- - 
SU^SJS T 



A Q G 
P D E 



G L 
SLL 



70 



F E T L 
A S I I 



CDQLL 
C P P L I, 



DIP 



80 



T I CARD12-C 



E K ElS| S V 



79 
81 



L . . . VLFLLD , Y , E C I . . L Consensus #1 

TEQTFRAILLQLKNQVLFLLDGYNEIKPQNCS IPQVIGAL Majority 

100 110 120 



90 

_JL- 



RKQTFMAMLLK 
TEMCMRNI I Q Q 



R Q R 
KNQ 



VLFLLD 
VLFLLD 



Y N 



FKPQN 



N H 



DLtf KLEII - 
. IR , . . 



P E 



[£|S I PQ VUJG KlLj NAHM: 



E A 



CARD12-C 



□ IQENHLSKTCVLVAVTTERARDIRQFGALIAEVGAFTEDS Majority 



lis' IiIk e|n h 



130 
l_ 



RFKNMVIVTTT 



MJ2L S RT C L L I A VRLtlNR A 



140 
L 



150 



E Consensus #1 

S : 

160 



E C L 



IRQFGALTA 



VGDMTEDS CARD12-C 



R Y L E T I Llfill QAFPFYN NAIP-C 



t\ R K. . .L KTPLPV. Consensus #1 

K\ AVALLREVLIKELAELRGLLVQIGKSQSL^NLQKTPLFVA Majority 

= 170 180 ISO 200 

1 L 



1S6 5 A Q A L I 



REVL IKELAE - - GLLLQIQ 



isR tvcilIbiklfshnmtrlrkfmvyf g1jc|n q sIiJq K I Q 



SRC 



RNLM 



KTPLFVV CAREL2-C 
K T P L F VI A NAIP-C 



yl - • C A F F L . . . NK Consensus #1 

CI AICAIQWGESEFDSSFTDVAVFK S'F YDLL ILKNKHKHKGV Majority 

i 1 r J * 

210 220 230 240 



194 I T 
197 A I 



J. 



CAIQMGESE 



HSH-TQTTL 



QJi "HWFQY PlZID P S F D D V A VLe|K SYME R\li S L R 



H T F Y D L 



I Q K 



N K 
ILK 



HKHKGV CARD12-C 
NAIP-C 



233 
230 



A CG.LAL.G.FS. .F.F, . ,D V.E Consensus #1 

AAADILKATVS SCGDLALEGVFSHKFDFELDDVAE AGVDE Majority 
1 r- 1 r- 

250 260 270 280 
i , ~- 1, , , l , i .I, 



aasdfirs-ldh 
[aJtaeilkatvss 



C G D 
£J3E 



L A L E 
L^lJKlfilF 



F S 



H K 



E L Q 



C ClZlELEIN D DlDjL A E A G 



V S - - S 



v[n 



CAREXL2-C 



D[Ej NAIP-C 



D • .LT. . L . . K . TAQR . .P.Y.F. . . . PQE . . A G . R L . . L Consensus #1 
DVLLTTGLLSKFTAQRLKPKYKFLSKAFQEFLAGRRLISL Majority 

i * r-= i r- 



270 



270 [DIE D 



D V L 



L T T G 
L TlM C 



290 
J. 



xJl c|k 
iJm sIk 



K Y 
K F 



TAQRFK 



T A Q Rl 



L R 



300 
i 



P K 



Y K 



LEI FUflRlEI L SPA 



F F H K S 



310 
L 



FQEYT 



FOE 



F L 



A G 



R L 



320 
J. 



S S 



CARD12-C 



I ELU NAIP-C 
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..E....G...L....S S • Y . . - L . Y Consensus #1 

LTSDEQEEVTLGLGHLQQIVSISDIVSAYSSLLNYVSGLS Majority 
1 i i i 

330 340 350 360 



310 
310 



L tsIhe 
jj d|_si d r 



E V T KIG 



N G Y 



QKMV 



QLElH Q D LlfilL Y H[L| K Q I NLSJ P MMT VlfijA 



I S D I T 



-L 



SSL 



T C G - S CAKEL2-C 



Y]N N FlLiNud V SSLP NAIP-C 

. L . . Q Consensus #1 



349 



S . . A HL L . . . S . . . 

SVEAGRAVVSHLAAVVDNKGSLLGLSIADDYLKHQESISL Majority 

1 I- 55 1 r 

370 380 390 400 

I ^ L 



V E 



ATRAVMK 



350 LSI T KLAj G P K I V S 



H L 
ILL 



AAVYQH-GC 



LGL 



IAKRP 



W R 



E S | CBHDE12-C 



LHLVDNKE SLlIE N l[£]E N D D YlljK HLQj P Eisi NAIF-C 



L Q V Y Q . . T . . Consensus #1 

QMQLLQGVKNIT-EQAILSAVSINLLVLAGITAYQSSTVAA Majority 

430 440 



385 



T 

410 
1— 



LQSVKNTTE 



390 QMQ LLU RGLWQ I C P|_Q| AYFSMVSEHLL 



420 
I 



EILKAININSF 



ECGIHL 
LALKTA 



YQ 



E S 



SKS CAREL2-C 



s nLeJv a a naip-c 



QG 



N 



F F . H 



. S . L Consensus #1 



CI ALSQVLEAFLQGKSLTLGAGNLPDYLFDFFDHLPESASAL Majority 
« H 6 r-^ 1 r 



421 ALSQ 
430 C S P F 



E F E A| F| 

V L Q -La 



450 
i 



QG 



K S 



460 

L 



Y I N S GN 



R TlL| T L G A LIU) L - QLXl 



I P D 



LFD 



470 

L 



F F 
HE 



E 



H 



DlHl- 



N C A 



480 

i_ 



l£|E S LlSlLlLJ NAIP-C 



CAREXL2-C 



I G . . . . . P Y..I OODSRTfais #1 



7% DSIKLSIRGGATASRAKAAVLTGGIDKSEAPTIDETYIPA Majority 

490 500 510 520 
J I U-^ p_ L 



411 
464 



L DFIK 
{ RSIIH 



L D F Y 



GGAMASWEKAAEDTGGIHMEEA 



F S I RLS NKTSPRAHFSVLETCFDKSQV 



E T 



IPS GAHD12-C 



Pj T I D Q DlH- - A NAIP-C 

A W E L ]• Consensus #1 

I SAVSLFNEWEQELATLEVTVKSF S'D LNKQAITDLGTGFSS Majority 



498 R 



V S L F F N 



502 SlAlF E PMNE 



530 
J 



KQEFRTL 



540 
I 



550 
t 



VTLRDFSKLNKQDITY 



ERNLAE KlH DNVK S YMDMQ RR A S P D|L] S T G Y W K NAIP-C 



560 



GKIFSS CARD12-C 



I 



V 



Ccnsensus #1 



ASSLQLQIKRCAGVAGSLSLVLSTCKNIYSLEVDASDLTV Majority 
a S 1 H 1 1 "T 

570 580 590 6b0 

-J 1. ■ , ■ - 5L 

LlM Vj E A S 



538 ATSLRLQ 
542 LSPKQYK 



IKR 



a - plq 



iL. 



P L T £L CARD12-C 
D I D iV NAIP-C 



AGVAGSLSLVLSTCKNIYS 

S qr . , . , L , . S Consensus #1 

VGEDHLTIVTNLTVLSIHDLASQRLEGGLTDSLGNLKGLI Majority 

1 — * ! r- 



610 
J- 



620 



630 



578 
562 



EDERHITSVTNLKT L S| I 
VGQDMLEIL - - MTV fLsJ - 



HDL QN 
AS 



Q R L P G G 
o rI IELH! 



ST D 
N H 



640 

L. 



SL GNLKNLT CTRD12-C 
Si RGFI NAIP-C 
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618 K L 



" * 1 K C L S . . . . Gcnsensus #1 

ELIRDALELSE ASAIKLAEGLKNLKKMCLISLLELSAAGE NfcnriH^ 

650 660 670 680 

J „ L 



MDNIKMNEEDAI 



592 ESlilRPALELSKAS VTIk 



K 



LAEGLKNLKKM 



SL F 
S I 



HLTH 
SKLE 



L S 



D I G E CARD12-C 
A A E Q NAIP-C 



S L . S . . . . L E Q...N Ooasensus #1 

GLLLIVKSLSS EPCDLEEI(j)LVSCCLVAGAVQILAQILHN MaHaritv 



658 GMDYIVK 
621 ELLLTLP 



1 —t 

690 700 710 720 

J m _L 



S L S 
IS LI ELS 



E P C D 



L E 



EIQLVSCCLSANAVKILA 
VSGTIQSQD 



N L H 



[fill F PlU 



N 



CAREXL2-C 
NAIP-C 



L . K . . . L . L . E GN I L. Cfcnsesus #1 

LVKLS ILDLSE LSVDLDGNIAVHSVIPDEFNVLEQLTALL Ma-iorifcv 

*_ 1 i i 

750 160 



698 
648 



LSI 



|_L| DLH F - -[L 



730 
J. 



L S 

llIk 



E N Y L E K D 
IE) L S V D L E 



740 
a. 



GNEALHEL 



I N V F S V 



-DRMNVLEQLTA 



Hjpeefphfhhme kIlI l naip-c 



M CARD12-C 



S.L. . . L L..L.L D . E . . fVtngiaricaug #1 

SJ LQIGADVDGSL SSLVASLEEVISLVILGLEGQQLTDTEIS Manarit^ 
0 770 780 790 800 



73$ LPWGCD VQGS L|S 
IQISAEYDP - - 



L K H 
KUJ V A S 



L E E V P Q 



ttJPNFI sLlIk iIl 



V K 



K N W R L T 
E G Q Q F P 



dJt! 



E| I R CARD12-C 
T S NAIP-C 



L.N...L.L G K Omflpnynis #1 

n ILGAF IGLGSLSNLEELILAGGDVSSDGWLAFMGVFEVAK Majority 



777= I L G 



810 



820 



A F F G K N P 



724, E K FLa| Y I - L G SLL 



K 



NFQQ 
\M L E E 



N 



830 



AGNRVSSDGWLAFM 



IUJ P T G D 



840 



V F E N L 



I Y R V A|i£] miB-C 



CARDGL2-C 



m L . . . L . . L . F Consensus #1 

!;iLVFFDFSTK EFLPDPALVQQLSQVLSVLSFLQTARLVGW Majority 

1 ■ I I 

860 870 880 



850 



817 QLVFFDFSTKEFLPDPALVRK 



753 L I I 



QQC Q QlU 



S Q V 



H ClLIR V 



S K 



SlFj 



. . .DD. . . . I 

QLDDDSVVVITGAFKLVTG 
T 

890 
1 



LQEARLVGW OTRD12-C 
FKT NAIP-C 

Ocnsfrisus #1 
Majority 



857 Q F D 
773 - L N 



D D D L S V 



S VV ELII 



TGAFKLVTA 
G 



CAHD12-C 
NAIP-C 



Consensus 'Consensus #1« ; When all match the residue of CARD12-C show the residue of 
CBKD12-C, otherwise show 



Decoration 'Decoration #1' : Box residues that match the consensus named 'Consensus #1' 
exactly. 



